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1  Problem 

The  main  objective  of  this  project  is  to  establish  the  mathematical  foundation  and  develop  efficient 
numerical  algorithms  for  network  clustering  or  network  partition.  Network  clustering  is  a  rather  popular 
subject  in  network  science.  However,  with  very  few  exceptions,  most  work  in  this  area  focuses  on  specific 
algorithms  rather  than  the  mathematical  foundation  of  the  problem.  In  other  words,  they  discuss  directly 
“how  to  do  it”  before  formulating  clearly  “what  needs  to  be  done”.  This  kind  of  approach  is  fairly 
common  in  computer  science:  Much  attention  is  given  to  developing  algorithms  rather  than  formulating 
clearly  the  mathematical  problem. 

Given  a  complex  network,  it  is  of  great  interest  to  reduce  the  network  to  a  much  smaller  one.  In  the 
context  of  Markov  chains,  this  issue  is  closely  related  to  the  issue  of  lumpability  of  a  Markov  chain:  If 
a  Markov  chain  is  lumpable,  then  the  lumped  chain  can  be  naturally  considered  as  the  reduction  of  the 
original  Markov  chain. 

Another  interesting  idea  that  we  have  proposed  is  to  endow  complex  networks  with  a  landscape. 
Landscape  has  been  a  very  useful  concept  for  continuous  systems  arising  in  physics,  chemistry,  biology, 
etc.  Building  upon  the  Markov  chain  structure  in  networks,  we  have  introduced  a  way  of  endowing  a 
landscape  to  a  network.  This  has  proved  to  be  very  useful  in  several  different  examples  arising  in  social 
and  biological  networks. 

2  Approach 

Our  basic  starting  point  is  to  identify  networks  with  Markov  chains  so  that  the  issue  of  network  clustering 
becomes  the  issue  of  approximate  lumpability  of  a  Markov  chain.  The  latter  problem  is  treated  in  the 
following  way.  For  each  possible  reduced  chain  on  a  partition  of  the  state  space  of  the  original  chain, 
we  can  associate  a  Markov  chain  on  the  original  state  space,  which  can  be  considered  as  the  lift  of  the 
reduced  chain  to  the  original  state  space.  This  is  done  with  the  help  of  the  invariant  measure  of  the 
chain.  The  Hilbert-Schmidt  norm  of  the  difference  between  the  transition  matrices  of  the  original  chain 
and  the  lifted  chain  is  a  measure  of  the  accuracy  of  the  reduced  chain.  The  optimal  reduced  chain  can 
be  obtained  by  minimizing  this  difference. 

Questions  that  we  need  to  address  include: 

1.  How  should  we  decide  how  big  the  reduced  chain  should  be,  i.e.  the  size  of  the  partition?  In  this 
context,  this  is  the  problem  of  “model  selection”. 

2.  Often  times  it  is  not  a  good  idea  to  associate  each  node  to  one  and  only  one  of  the  clusters.  It 
is  more  appropriate  to  associate  a  probability  distribution  for  a  given  node  to  belong  to  a  certain 
cluster. 
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3.  A  real  network  has  its  physical  meaning.  How  do  we  attach  a  physical  meaning  to  the  reduced 
network? 

In  the  last  decade  or  so,  network  has  emerged  as  the  appropriate  tool  for  modeling  social,  economic, 
biological  and  physical  interactions  in  a  wide  range  of  disciplines.  These  networks  also  have  many 
interesting  and  complex  structural  and/or  dynamic  properties  which  require  new  understanding.  With 
very  few  exceptions,  the  work  done  in  network  science  is  descriptive  in  nature  rather  than  deductive. 
Many  results  are  empirical  or  observational  facts  rather  than  consequences  of  fundamental  principles. 
Much  work  has  to  be  done  in  order  to  establish  the  mathematical  foundation  of  network  science.  The 
present  project  is  a  step  in  that  direction. 

One  can  make  a  comparison  with  image  processing.  Image  processing  has  been  a  subject  in  computer 
science  for  a  long  time.  Many  techniques  have  been  developed.  However,  these  techniques  do  not  address 
the  question:  What  do  we  want  to  achieve  with  these  techniques?  They  simply  produce  some  results 
without  explaining  the  rationale  behind  the  techniques. 

The  work  of  Mumford-Shah,  Osher  and  co-workers  changed  that.  Instead  of  going  directly  to  the 
algorithms,  one  first  formulates  the  objective  in  terms  of  a  mathematical  problem.  Algorithms  are  then 
developed  to  solve  the  mathematical  problem.  In  this  way,  one  has  a  much  better  idea  of  what  one  is 
trying  to  do. 


3  Results 

So  far  our  accomplishments  are  as  follows: 

•  We  have  formulated  the  mathematical  model  for  network  clustering. 

•  We  have  formulated  a  mathematical  framework  for  probablistic  clustering,  which  allows  each  node 
to  have  a  certain  probability  of  belonging  to  a  cluster. 

•  We  have  developed  very  efficient  numerical  algorithms  for  both  deterministic  and  probabilistic 
clustering. 

•  We  have  tested  the  algorithms  on  realistic  network  models. 

We  have  also  introduced  the  concept  of  diffusion  complex.  This  is  analogous  to  the  Witten  complex  in 
quantum  mechanics  in  the  context  of  diffusion  process.  We  are  now  working  on  extending  this  concept  to 
networks.  As  a  first  step,  we  have  introduced  the  concept  of  landscapes  for  complex  networks.  From  the 
invariant  distribution  of  the  Markov  chain,  one  can  introduce  an  analog  of  “  free  energy”  for  networks. 
One  can  then  define  focal  minima  and  local  maxima,  as  well  as  saddle  points  for  the  free  energy  on  the 
network.  We  have  tested  these  concepts  on  some  examples  from  social  and  biological  networks.  The 
results  are  very  promising:  They  reveal  important  information  such  as  “metastable  saddles”  for  protein 
structure  network. 


4  Publications 

•  ELiV08  W.  E,  T.  Li  and  E.  Vanden-Eijnden,  “Optimal  partition  and  effective  dynamics  of  complex 
networks,”  Proc.  Natl.  Acad.  Sci.  USA,  vol.  105,  pp.  7907-7912,  2008. 

•  Tiejun  Li,  Jian  Liu  and  Weinan  E,  “A  probablistic  framework  for  network  partition”,  Phys.  Rev. 
E,  vol.  80,  no.  2,  026106,  2009. 
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Figure  1:  The  deterministic  partition  of  the  famous  Karate  club  network.  This  result  agrees  with  the 
result  of  Zachary’s  well-known  work. 

•  Weinan  E  and  Tiejin  Li,  “A  unified  mathematical  framework  for  network  clustering” ,  in  prepara¬ 
tion. 

•  Weinan  E  and  Jianfeng  Lu,  “Diffusion  complexes”,  preprint,  submitted  to  Comm.  Pure  Appl. 
Math. 

•  Weinan  E,  Jianfeng  Lu  and  Yuan  Yao,  “The  landscape  of  complex  networks”,  preprint. 

5  Student/postdoc  supported 

Students  supported: 

Xiang  Zhou,  date  of  completion  of  Ph.  D.  2009. 

Hao  Siren,  in  progress. 

Post-docs  supported: 

Xiang  Zhou,  September  2009  to  August  2011. 
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•  Awarded  SIAM  Ralph  Kleinman  Prize. 

Citation:  For  his  extraordinary  interdisciplinary  contributions  and  for  his  exemplary  record  in 
mentoring  students  and  postdocs.  He  has  had  profound  impacts  on  research  in  stochastic  par¬ 
tial  differential  equations  and  turbulence,  numerical  solution  of  multiscale  problems,  dynamics  of 
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Figure  2:  The  gray  scale  plot  of  px  and  pw  (which  are  probabilities  for  belonging  to  the  two  clusters 
respectively)  for  each  node  in  karate  club  network.  The  darker  the  color,  the  larger  the  value  of  px-  The 
transition  nodes  or  intermediate  nodes  are  clearly  shown. 
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interacting  dislocations,  liquid  crystals  and  polymers,  metastability,  protein  folding,  gas  dynam¬ 
ics,  epitaxial  growth,  micromagnetics,  and  superconductivity.  His  vision  and  breadth  are  truly 
remarkable. 

•  Elected  to  the  inaugral  class  of  SIAM  fellows. 

7  Conclusions 

•  Complex  networks  is  an  important  subject  in  many  disciplines,  particularly  in  social  science. 

•  Markov  chains  provide  a  natural  mathematical  framework  for  studying  the  structure  and  dynamics 
of  complex  networks. 

•  Network  partition  (clustering)  has  been  a  popular  subject  in  computer  science,  but  it  really  needs 
a  solid  mathematical  treatment. 

•  We  have  developed  the  mathematical  framework  and  related  algorithms  for  network  partition. 

8  Future  plans 

We  intend  to  continue  our  study  of  the  dynamics  of  complex  networks,  in  the  same  style.  In  particular, 

we  will  investigate  the  following  problems: 

1.  Diffusion  on  complex  networks.  Study  how  the  diffusion  process  depends  on  network  structure. 

2.  Stochastic  games  on  complex  networks.  So  far,  game  theory  has  only  been  studied  on  relatively 
simple  networks.  In  the  same  spirit  as  above,  we  will  study  how  the  network  structure  affects 
games  on  these  networks. 

3.  Physical  significance.  It  is  important  to  relate  the  mathematical  results  back  to  the  practical 
situation  in  which  the  networks  come  about.  This  requires  a  deep  understanding  of  the  original 
problem,  as  well  as  the  mathematical  analysis  of  these  networks. 


5 


